tests: enable higher concurrency and adjust tests with outlier runtime #4904

jcsp · 2023-08-04T13:51:48Z

Problem

I spent a few minutes seeing how fast I could get our regression test suite to run on my workstation, for when I want to run a "did I break anything?" smoke test before pushing to CI.

Test runtime was dominated by a couple of tests that run for longer than all the others take together
Test concurrency was limited to <16 by the ports-per-worker setting

There's no "right answer" for how long a test should
be, but as a rule of thumb, no one test should run
for much longer than the time it takes to run all the
other tests together.

Summary of changes

Make the ports per worker setting dynamic depending on worker count
Modify the longest running tests to run for a shorter time (test_duplicate_layers which uses a pgbench runtime) or fewer iterations (test_restarts_frequent_checkpoints).

Checklist before requesting a review

I have performed a self-review of my code.
If it is a core feature, I have added thorough tests.
Do we need to implement analytics? if so did you add the relevant metrics to the dashboard?
If this PR requires public announcement, mark it with /release-notes label and add several sentences in this section.

Checklist before merging

Do not forget to reformat commit message to not include the above checklist

The overall suite runs comfortably within about 4 minutes when these tests are modified: previously the runtime is about 10 minutes, most of which is just spent waiting for these 2 tests to finish. There's no "right answer" for how long a test should be, but as a rule of thumb, no one test should run for much longer than the time it takes to run all the other tests together.

This was previously at 1000, having been bumped from an earlier value of 100. This limited concurrency on systems with larger numbers of cores, and none of the tests we have today require more than 384 ports.

github-actions · 2023-08-04T14:13:03Z

1264 tests run: 1211 passed, 0 failed, 53 skipped (full report)

petuhovskiy · 2023-08-04T15:29:38Z

test_restarts_frequent_checkpoints and similar test_restarts_under_load were useful before, when we had bugs, but now I haven't seen them fail for a long time. +1 for decreasing iterations.

test_runner/fixtures/neon_fixtures.py

bayandin

Looks good!

Don't forget to update "Summary of changes" in the description 😉

jcsp added 2 commits August 4, 2023 14:48

tests: reduce ports per worker to 384

367ffff

This was previously at 1000, having been bumped from an earlier value of 100. This limited concurrency on systems with larger numbers of cores, and none of the tests we have today require more than 384 ports.

jcsp added a/test Area: related to testing a/tech_debt Area: related to tech debt labels Aug 4, 2023

jcsp marked this pull request as ready for review August 4, 2023 15:20

vadim2404 requested a review from bayandin August 4, 2023 15:25

bayandin reviewed Aug 7, 2023

View reviewed changes

test_runner/fixtures/neon_fixtures.py Outdated Show resolved Hide resolved

make per-worker port count dynamic

f4cd80f

bayandin approved these changes Aug 7, 2023

View reviewed changes

jcsp merged commit 33cb1e9 into main Aug 8, 2023

jcsp deleted the jcsp/faster-tests branch August 8, 2023 08:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tests: enable higher concurrency and adjust tests with outlier runtime #4904

tests: enable higher concurrency and adjust tests with outlier runtime #4904

jcsp commented Aug 4, 2023 •

edited

Loading

github-actions bot commented Aug 4, 2023 •

edited

Loading

petuhovskiy commented Aug 4, 2023

bayandin left a comment

tests: enable higher concurrency and adjust tests with outlier runtime #4904

tests: enable higher concurrency and adjust tests with outlier runtime #4904

Conversation

jcsp commented Aug 4, 2023 • edited Loading

Problem

Summary of changes

Checklist before requesting a review

Checklist before merging

github-actions bot commented Aug 4, 2023 • edited Loading

1264 tests run: 1211 passed, 0 failed, 53 skipped (full report)

petuhovskiy commented Aug 4, 2023

bayandin left a comment

Choose a reason for hiding this comment

jcsp commented Aug 4, 2023 •

edited

Loading

github-actions bot commented Aug 4, 2023 •

edited

Loading